A Berry-Esseen bound with applications to vertex 
degree counts in the Erdos-Renyi random graph 
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Abstract 

Applying Stein's method, an inductive technique and size bias couphng yields a 
Berry-Esseen theorem for normal approximation without the usual restriction that the 
coupling be bounded. The theorem is applied to counting the number of vertices in 
the Erdos-Renyi random graph of a given degree. 

1 Introduction 

We present a new Berry-Esseen theorem for sums Y of dependent variables b y corn bining 
Stein's method, size bias couplings, and the inductive technique of Bolthausen (119841 ) origi- 
nally developed for the combinatorial central limit theorem. We apply the theorem to asses 
the accuracy of the normal approxim ation to the distribution of the number of vertices of 
degree d in the classical Erdos-Renyi (119591 ) random graph G„ having n vertices connected 
by independent edges with common success probability depending on n and a parameter 
6. Over the range of parameters considered t he theorem yie l ds a bound that is the same 
up to constants as the one obtained earlier by iBarbour et al.l (119891 ) for the weaker smooth 
function metric (TT^. 



Stein's method (jSteinl (ll972l ). ISteinl ( 119861 )) often proceeds by coupling a random variable 

Y of interest to a related variable Y' , using, for example, the method of exchang eable pairs, 
size bias couplings, or zero bias couplings; for an overview see IChen et al.l (|2010| ). The chief 
innovation here is the removal of an inconvenient restriction present in a number of results 
that provide Kolmogorov distance bounds using Stein's method, that the difference |y — F'l 
between Y and the coupled Y' be bounded almost surely by a constant. Through the use of 
an unb ounded coupling, in Theorem 12. 11 we are able to extend the previous work by iKordecki 
(Il990l ) on the number of isolated, or degree zero, vertices of G„ to all positive degrees. 

To describe Theorem ll.il our general result, recall that for a nonnegative random variable 

Y with finite, nonzero mean /i we say that Y'^ has the Y-size bias distribution if 



E[Yf(Y)] = fiE[f(Y'^)] for all functions / for which these expectations exist. 



:i) 
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In emp loyi ng; the size bi a s vers ion of Stein's method (see lBaldi et al.l ( 119891 ) , iGoldstein and Rinott 
(Il996l ) and I Chen et ahl (I2OIOI )). the goal is to construct, on the same space as Y, a variable 
y* with the Y-size bias distribution such that Y and Y^ are close is some sense. Previous 
applications of the size bias coupling technique for obtaining Berry-E s seen bounds by Stein's 



(120051 ) . IGoldstein and Penrose 



metho d, re quiring that IF'^— y| be bounded, include IGoldstein 
(l2010h and IGoldstein and Zhan^ (l201lh . 

Let N = {1,2,...} and Nq = N U {0}. Our abstract framework consists of random 
elements indexed by n > no for some tiq G Nq whose distributions Cg{-) depend on n, left 
implicit when clear from context, and a parameter ^ in a topological space 0„. In our 
particular application the parameter 6 lies in a subset On of the real numbers M and interest 
centers on the distributions of the nonnegative random variables Yn counting the number of 
degree d E N vertices of the Erdos-Renyi random graph G„. 

In general, when Yn is a given sum of nontrivial exchangeable indicators. Lemma 11.11 
below says, essentially, that to construct a variable Y^ with the F„-size bias distribution, 
one chooses an indicator uniformly and sets it to one if it was not so already, and then 
'adjusts' the remaining indicators, if necessary, to have their original distribution given that 
the selected indicator is one. Applying Lem ma II. II when Y„ count s the number of vertices of 
degree d in Gn results in the construction of iBarbour et al.l ( 1l992l ). where nothing is changed 
if a uniformly chosen vertex already has degree d, and otherwise edges to the chosen vertex 
are added if the vertex has degree less than d, or removed if it has degree in excess of d. As 
it is possible that the chosen vertex has, say, n — 1 edges, the resulting coupling fails to be 
bounded in n. Nevertheless, when there is only a small probability that a very large number 
of edges will need to be added or removed, the coupling can be controlled using moments on 
bounds Kn that satisfy IF^** — F„| < i^„. 

After coupling, the second ingredient in our method has an inductive flavor. We construct 
a variable Vn such that its distribution, conditional on a collection J„ of random elements, 
is that of Yn, reduced in size by some 'small' amount Ln, with parameter ipn,9 'close' to the 
original 6. Formally, we require that 



^e{Vn\Jn) — C,^„^0iXn-Lr, 



(2) 



hold on an event where the size of L„ is controlled, and that a bound Bn on the absolute 
difference |F„ — Vn\ not be 'too large.' As bounds to the normal for y„ can be expressed 
in terms of quantities that include bounds to the normal for reduced versions of the same 
problem, a recursive inequality for the sought after bound can be produced. 

In the graph degree problem Vn counts the number of degree d vertices in the graph 
obtained by removing a uniformly chosen vertex from Gn-, along with all its incident edges, 
and the set J„ consists of the identity of the chosen vertex, and its degree. Conditional on 
Jn the graph that remains is an Erdos-Renyi graph on the reduced vertex set, with the same 
connectivity as before. As with the bound Kn, it is not required that Bn be almost surely 
bounded by a constant; though |y„ — Vn\ may be large in the graph degree problem, it is 
unlikely that it will be. 

Tension exists in choosing the set J„ that appears in the conditioning equality (|2]). In 
order to reduce the larger problem to a smaller one so that induction may be applied, 
working conditionally we must be able to treat the bounds Kn and Bn, and the parameters 
of the reduced problem, Ln and ipn,e, as constants. Hence we require that these variables be 
measurable with respect to J^n, the a-algebra generated by the conditioning collection J„. 



Though this restriction necessitates that J-'n be large enough to contain, say, information on 
Y^ — Yn, it must also be small enough so that, say, L„ and B^ are not too large, and that the 
conditioning 'leaves enough randomness' to yield a useful recursion for the ultimate bound. 
At the heart of our main result, and Stein's method for normal approximation, is the 
characterization that Z is a standard normal random variable if and only if 

E[Zf{Z)] = E[f{Z)] 

for all absolutely continuous functions / for which the above expectations exist. This char- 
acterization leads to the Stein equation, when, given a test function h on which to evaluate 
the difference Eh{W) — Eh{Z) between the expectation of the random variable W of interest 
and the standard normal Z, one solves 

f'{w) - wf{w) = h{w) - Eh{Z) 

for /. Using /, one evaluates this difference by substituting W for w, and takes expectation 
on the left hand side, rather than the right. Though we focus on manip ulation of th e Stein 



equation using the size bias coupling, many variations are possible, see IChen et al.l (120101 ) 
for an overview. 

Throughout, for uq eN and all n > tiq and 6^ G ©„ we let ^n,e = EgYn and cXng = Var5i(y„) 
indicate the mean and variance of y„ under Cg. The value r^^e appearing in Theorem II. II is 
a function that determines the quality of the bound to the normal, while the sequence Sn,9 is 
used to control L„, and hence the size of the smaller subproblem Vn related to F„. Without 
further mention, /x„e,cr^g and r„e are assumed to be measurable in 6' G 9„, a condition 
satisfied for all natural examples, including the one considered here. To avoid repetition, the 
distribution of random variables indicated after 9 E On has been fixed is with respect to Ce- 
The random variable Z will always denote the standard normal. 

To familiarize the reader with the conditions of Theorem II. 1| towards the end of this 
section we present its application in the simple case where a bounded size bias coupling of 
Y^ to Yn exists. 

Theorem 1.1 For some uq G Nq and all n > uq and 9 G 0„, let Y^ he a nonnegative 
random variable with mean fin,9 and positive variance u'^q, and set 

Wn,e = ^:^i^i^, (3) 

the standardized value of Y^. Let r^^e be positive for all n > Hq and 9 E Qn, o^i^d for all 
r > 0, letting 

Qn,r = {9 eQn- Tnfi > r}, 

assume there exists ri > and ui > uq such that 

max sup r„e < oo. (4) 

Further, suppose that for all n > rii and 9 G 6n,ri) there exist random variables Y^,Kn,Ln, 
ipn^e, Vn and Bn on the same space as Yn, and a a-algebra Tn, generated by a collection of 
random elements Jn, such that the following conditions hold. 



1. The random variable Y^ has the Yn-size bias distribution, and 

"^nfi = a/ Fare {Eg(Y^ - F„|F„)) satisfies sup — — ^ — — < oo. (5) 

2. The random variable K^ is J^n-^^^o-surable, \Y^ — Yn\ < Kn, and 

sup — ■ ■ 3 ■ < OO, (6) 

n>ni,eee„,rj^ '^n,e 

with Wnfi as given in ^. 

3. The random variable L„ takes values in {0,1,..., n}, there exists a positive integer 
valued sequence {snfi}n>ni satisfying n — Sn,e > "^o; the variables L^ and ipn,e are 
J^n-measurable, there exists Fnfi G J-'„, Fn^ C {L„ < Sn,e} such that 

i^nfi^Qn-Lr, and C0{Vn\Jn) = C^^A^ri-L-r?) On F^fi (7) 

and 

sup '^^^Ee [Klil - 1(F„,,))] < oo. (8) 



4- There exists {ci,C2} C (0, oo) such that 

5. The random variable B^ is J'n-T^Gasurable, \Yn — Vn\ < Bn and 



(^le < cicrl.r^. and r„,e < C2r„_i„,^„ „ on F^ 



rlgfinfiEelKlB,,] 
sup — ■ ^ < oo. (9j 

n>ni,eeOn^ri ^n,e 

6. Either 

(a) there exists Infi G Nq such that PelLn = /„,o) = 1 for all 9 G 0„,,ri 
or 

(b) the set 0„,r-i is a compact subset of Qn, and the functions of 9 



E,[J^l(L„^l)) 



i„,w = Bo ( ;^l(in = /) j , le{0,l,...,n} (10) 

are continuous on <dn,ri for / G {0, 1, . . . , s„} where s„ = sup^gQ^ ^ Sn,e- 
Then there exists a constant C such that for all n > uq and 9 E Qn 

sup \Pe{Wn.e < z) - P{Z < z)\ < C/rn,e. (11) 



When higher moments exist a number of the conditions of the theorem may be verified 
using standard inequahties. In particular, by the Cauchy-Schwarz inequahty a sufficient 
condition for (El) is 



,1/2 

sup 5 < oo where k. 



and, when F„ g = {L^fi < Sn^} a sufficient condition for ([8]) is 

1 1 



771 Ty^m 

n,tl,m — ^eJ^n i 



sup 



< OO where I 



^nfi^nfi 



771 jva 



since, additionally using the Markov inequality yields 



(12) 



1,2 72 
Similarly, a sufficient condition for (|9]) is 



1 1 

^n,9A''",S^n,6»,4"n,0,2 , , 

sup J < OO where o„ 






(13) 



A general prescription for size biasing a sum of nonnegative variables is given in lGoldstein and Rinott 



(Il996l ): specializing to exchangeable indicators yields the following result. 



Lemma 1.1 LetY = XlaeX"^" ^^ a finite sum of nontrivial exchangeable Bernoulli variables 
{Xa, OL G X}, and suppose that for a G X the variables {Xa, /3 G X} have joint distribution 



Then 



CiXl /3 G X) = £(X^, /3 G X|X, = 1). 






has the Y-size biased distribution Y^ , as does the mixture Y^ when I is a random index with 
values in I, independent of all other variables. 

Proof: First, fixing a G X, we show that F" satisfies ([T]). For given /, 

E[Yf{Y)] = J2E[X,f{Y)] =J2p[X, = l]E[f{Y)\X, = 1]. 

As exchangeability implies that E[f{Y)\Xi3 = 1] does not depend on /3, we have 

E[YfiY)] = (J2p[X, = 1] ) E[fiY)\X^ = 1] = E[Y]E[f{Y^)], 
\/3ex / 

demonstrating the first result. The second follows easily using that Y^ is a mixture of random 
variables all of which have distribution y . D 



Employing size bias couplings and Stein's method, I Chen and RoUinl ( 120101 ) prove a gen- 
eral result to compute bounds t o the normal in the Was erstein metric. In particular, Corol- 
lary 2.2 and Construction 3 A of IChen and RoUinl (120101 ) yield 



dw{Ce{Wnfi),C{Z))<{).% 



l^nfi^nfi . fJ"n,ekn,e,2 



+ 



a. 



nfi 



a, 



(14) 



n,e 



To compare (IT^ with one conclusion of Theorem 11.11 as well as to familiarize the reader 
with the roles of some of the variables appearing in its formulation, we now consider its 
application in the simple case where a bounded size bias coupling exists, that is, when the 
bound Kn on |F^ — y^| can be taken to be a constant, say /c„, almost surely. In such cases 
we set Jn to be the empty set, and note that any constant is measurable with respect to 
the trivial cr-algebra that Jn generates. Conditions [3] through are easily satisfied in this 
case. In particular, taking Ln = 0,Sn,e = 1 and Fn^g = {L„ < Sn,e}, with Jn = 0, dZ]) of 
Condition |3] holds with ipn,9 = 6 and Vn = Yn, and (|8]) holds as 1 — l{Fn,g) = a.s. As 
(n — Ln, 4'n,e) = {n, 9), Condition |4] holds with ci = C2 = 1. As Vn = Yn we may take i?„ = 
in Condition and as L„ = Conditioning is satisfied. Hence, only Conditions |T] and |2] are 
in force, and Theorem 11.11 obtains with 

-1 /^n,6»^n,0 , fJ'nfikn 



+ 



<JZ 



o"; 



yielding a Kolmogorov bound that, up to constants, agrees with the Wasserstein bound (fT4ll 
in this particular case. 

Bounded size bias couplings exist not only when Yn is the sum of independent, bounded 
nonnegative random variables, or a sum of bounded, nonnegative locally d ependent y a riable s 



with bounded dependence neighborhood sizes, as studied, for instance, in iGoldstein 



(120051). 

see 



Bounded size bias coupling s c an also be constructed in ca ses of global dependence 
Goldstein and ZhangI (120111 ) or IGoldstein and Penrose! (120101 ). 

We next apply Theorem 11.11 to vertex degree counts in the Erdos-Renyi random graph. 
The proof of Theorem 11.11 is given in Section |3l 



2 Vertex degree in the Erdos-Renyi random graph 



We apply Theorem 11.11 to bound the error in the normal appr oxima tion to the distribution 
of t he numbe r of ve rtices of a given degree in the Erdos-Renyi (119591 ) random graph G„ (see 
also iBoUobasl . Il985l ) . With n G N we take the vertex set of Gn to be X^ = {1, . . . ,n}, and the 
indicators ^u,v of the presence of edges between distinct vertices u and v to be independent 
Bernoulli variables with a common success probability. No vertex is connected to itself, and 
we set ^u,u = for all u & Xn- 

The number F„ of vertices of degree d of Gn has been the object of much study. For a se- 
quenc e of graphs with connectivity probability p depending on n G N, iKarohski and Rucihski 
( 119871 ) proved the asymptotic normality of Yn when n^'^^^ ^^'^p -> oo and np — )• 0, or np — )■ oo 



and np — \ogn — dloglogra — > — oo; see also iPalkal (Il984f) and iBoUobasI (119851 ). A symptotic 
norma lity of Yn when np ^ c > 0, was obtained by iBarbour et al.l (19891). and iKordecki 



1990) for non-smooth functions of Yn in the case d = 0. iNeammanee and Suntadkarn 



( 20091 ) obtain a Kolmogorov distance bound between Yn and the normal with rate n ^/^+^ 



for all e > when Var(y„) is of order n . Other univariate results o n asymptotic normality 
of counts on random graphs are given in lJanson and Nowickil (jl99ll ). and references therein. 
Goldstein and RinottI (119961 ) obtain smooth function bounds for the vector whose k compo- 
nents count the number of vertices of fixed degrees (ii, d2, . . ■ ,dk when p = /{n — 1) G (0, 1) 
for fixed 9, implying asymptotic multivariate joint normality. 

We focus on the counts of vertices of some fixed degr ee d &'H, the case c? = of isolated 
vertices having already been handled by iKordeckil ( 19901 ). Set 



e„ = (0,n-l)n(0,6] foralln>rf + l 



(15) 



with h some arbitrarily large constant, and let the connectivity probability between the 
vertices of G„ be given by 6 /{n — 1) for n > rf + 1, 6* G 9„. For w G X„ let 

Dn{v) = ^ ^v,w, Xn,v = 1(-Dn(w) = d) and Yn=^ Xn,v, 



wGX„ 



ueXn 



the degree of vertex v, the indicator that vertex v has degree d, and the number of vertices 
of degree d of G„, respectively. 



From iGoldstein and RinottI (119961 ). for all n > c? + 1 and 6 G G^, the mean /i„ e and 
variance a"^ q of F„ are given explicitly by 



^nfi = nTn 



and a, 



n,ti 



nT„ 



{d-ey 



where 



r„ 



n — 1 
d 



n 



1 



e{i-e/{n-i)) 
e 



- 1 



+ nTn 



n 



1 



n—l—d 



(16) 



(17) 



Theorem 2.1 For any d G N and b > there exists a constant C such that for all n > d + 1 
and all 6 E Qn given in ( 173]) . the normalized count Wn,e given by ^, of the number ¥„, 
of vertices with degree d in the Erdos-Renyi random graph Gn on n vertices, with edges 
connecting each distinct pair independently with probability Qjin — 1), satisfies 

sup \P0{Wn,e < z) - P{Z <z)\< Cjr^^Q for alln>d + l, 

where Z is a standard normal variable and 

rn,0 = ^JrvFe with Tg = e'^O'^/dl (18) 

In the metric di over Lipschitz functions h defined by 

\Eh{X) - Eh{Y)\ 



h 



\h\\ + \\h' 



(19) 



by applying Stein's method iBarbour et al.l ( 119891 ) obtain a bound of order \j ^ffUf^. As 
Lemma [2.11 shows that Tnfi/re converges uniformly to 1 over G„, the Kolmogorov bound of 
order 1/ y/uTp provide d by Theorem 12. II is of the same order as the d^ bound. As remarked in 






Barbour et al.l (119891 ). a bound of size e„ in the di metric yields a bound in the Kolmogorov 



metric of order 0(e„ ), which can at times be improved to 0(e„) 'at the cost of much greater 
effort.' 

Though we do not cover the case d = oi isolated vertices, handled in iKordeckil ( 1l990l ). 



our proof can be extended to apply there by appending additional arguments that are sep- 
arate, but similar to, those for the case d & N. Note, for example, the difference in the 
behavior of the function Tg at zero for these two ranges of d. 

Following Lemma 11.11 for the case of vertex degrees yields a coupling where for each 
n > d + 1 and vertex f G X„ one constructs a graph GJ^ from G„ having the distribution 
of Gn conditione d on Xn,,, = 1, or equ ivale ntly, on Dn(v) = d; th i s coup hng has previously 
been applied by iBarbour et al.l ( 119921 ) and iGoldstein and RinottI (Il996[ ). The graph C^ is 
obtained from Gn by adding or removing edges of v as needed. Mixing over v as indicated 
by Lemma 11.11 yields a variable Y^ having the y„-size bias distribution. 

In the course of constructing G^ one also obtains a set TZ^ holding the collection of 
vertices other than v that are affected by the size bias operation. In particular, if Dn{v) = d 
then G^ = Gn and TZ!^ = 0- If -D„(f) > d then Gn is formed by removing from G„ the 
edges between v and the vertices in the subset TZ^ of neighbors {u : C,u,v = 1} oi v, chosen 
with uniform conditional distribution given Gn over all subsets of the neighbors of v of size 
Dniy) — d. Similarly, if Dniv) < d then G'"^ is formed by adding edges to Gn between v and 
vertices in 7?.^, chosen with uniform conditional distribution given Gn over all subsets of the 
non-neighbors {u : u ^ v, ^u,v = 0} of f of size d — Dn{v). 

Now let X^ ^ be the indicator that vertex w has degree d in Gn and 



n 



EX" 
n,wi 



the number of degree d vertices in G^. When /„ is chosen uniformly over X„, independent 
of all other variables. Lemma 11.11 yields that Y^ = Y^" has the y„-size biased distribution. 
Similarly setting 7^* = 7^^", all vertices not in {/„} U 7^^ have the same degree in both Gn 
and G* , and as /„ ^ 7?.* , letting 



An = {In} U n'n WC haVC \An\ = 1 + \d - D„(/„)|. 



(20) 



We prove Theorem 12.11 by verifying the hypotheses of Theorem 11.11 for the size bias 
construction just given. With r„^e as in ( !T7|) . and recalling (TT6|) . let 



Sua 



Tn 



{d-6f 



-1 



_^(l-^/(n-l)) 
and correspondingly, with tq as in ( TTSl) . let 

\d-ef 



+ 1 so that 0"^ 



nTnftSnfi, 



(21) 



5e = Tg 



e 



1 



+ 1. 



(22) 



With the help of a technical lemma placed at the end of this section, we present the proof 
of Theorem 12.11 Throughout we let Cj denote a constant not depending on n or 6', and not 
necessarily the same at each occurrence. 

Proof: Let tlq = d + 1. For n > Uq and 6 G 0„ the binomial and Poisson probabilities r„ ^ 
and Tg in flT7|l and flTSl) . respectively, lie in (0, 1), and hence a'^g of flT6|) and r„e are positive 



8 



for all such n and 9. Let ri > be arbitrary. In place of naming ni explicitly, we show the 
remaining conditions of Theorem II .11 are satisfied for all n sufficiently large. Since r„ g < ^Jn 
inequalit y (|4|) holds f o r any rii > hq. 

From IChen et al.l (J2ninl . (12.17)), following lOoldstein and Rinottl ( 1l996l ). for Y^ having 
the 1^-size biased distribution as constructed above, we obtain 

^n e < CiTi-^ (246 + 48^2 + ^44^3 ^ 43^2 ^ ^44^^2 ^ ^2) and hence sup ^„ ^ < ^• 

eee„ ' V^ 



To complete the verification of Condition [H Lemma 12.11 gives that over B„ the ratio 
^el^nfi = SefJ'n,9/o'ng couvcrges uniformly to 1, and 6g in ( 122|) is bounded away from zero. 
Hence for all n sufficiently large and all 6 G 0^ we have 



f^n,e 



0": 



Ob 



and so ?; < C^^/Te < C4, 



a. 



(23) 



n,e 



as Te < 1 for all 6 G G„. 

Turning to the Condition |2l let 

J„ = {/„,£>„(/„)} and j; = or{J„}, 

that is J^n is the a-algebra generated by the chosen vertex and its degree. Further, let 



Clearly Kn is J-'„- measurable, and recalling that vertices not in An of fl20|) have the same 
degree in both Gn and G^, taking the difference between Y^ and F„ yields 

Yn -Yn= Yl (^«:- - ^".-)' a^d (EDD yields 11;^ -Y^\=l + \d- D{In)\ < Kn. 

W&An 



Next, for all m G N we have 

K^<2^-\{l + dr + Dn{In) 



(24) 



To bound the moments of Kn, using iRiordaru ( 119371 ) for the first equality below, with 
Sj^rn the Stirling numbers of the second kind and {n)j the falling factorial, with C^^m = 
mmaxi<j<m Sj^m and D ~ Bin(n — l,p) we obtain 



ED' 






[n 



l),y < Y. ^^^rn{n - 1)V < C,,m{{n - l)p + (u - 1)>"^). 



In particular EoDniv)'^ < C^^mip + 6™), and as Dn{In) is the mixture of the identical 
distributions Dn{v) over t> G X„, it obeys the same upper bound. Taking expectation in ([24 
we find that there exists constants Cq^,,^^ ^^ such that 



kn,e,m. < C^^m for all n G N, 6* G 6„ and m &'H. 



(25) 



Now, using (l25l) for the first inequality in (|26|) . the first inequality in (|23|) for the second 
inequality, the second equality of ( 12T]) for the first equality, and Lemma 12.11 both to obtain 



9 



the third inequahty, and the boundedness of 5g away from zero for the fourth, we obtain 
that for all n sufficiently large and 6 G 0„, 



.1/2 



ef^n,eK,e,A , Cfi//'^rn,efJ'n,e . Cjrn^ _ C-j^q Q 



8 



^ -0.^ ■n,.^^n,. ^ ^^rn^ ^ -,y. < ^ < ^^^ (36) 



Hence inequality ( 1T2|) . sufficient for (|6]), is satisfied, and Condition |2] holds. 
Turning to Condition |3l for n > rf + 2 let 

Ln = 1, Sn,9 = 1 'ipn,e = f _ ^ j and Fn,e = {Ln < Sn,e}, (27) 

and note therefore that conditions holding on Fn^e niust hold on the entire probability space. 
Clearly L„ takes values in {0, 1, ... , n} as required and n — Sn,e > no for any n > d + 2. 
Being constants, L„ and ifjnfi are J^n measurable, hence Fnfi G Jvj. By ( 127|) and 6* G 6„ we 
have that iljnfi ^ (0, &] fl (0, n — 2) = 6„_i = 0n-L„, verifying the first part of ([7]). 

Regarding the second part of (I?!), let Hn be the graph (?„ with the vertex /„ and its 
incident edges removed, relabeling the remaining vertices {l,...,n — l}by preserving their 
relative order. Let Vn be the number of degree d vertices of Hn- By counting the number of 
degree d vertices, the distributional equality in ([7]) is a consequence of 

^e{Hn\In, Dn{In)) = C^^g{Gn-l)- (28) 

The graph if„ is determined by {iu,v '■ {w,f } C X„ \ {/n}}, which is independent of the a- 
algebra generated by {/„, ^/„,i,, v G X„}, with respect to which /„ and Dn{In) are measurable. 
Hence Hn is independent of the conditioning event in fl28l) . and therefore its conditional 
and unconditional distribution agree. In particular, conditional on {/„,!)„(/„)} the edge 
indicators of Hn are independent with common success probability 

i^nfi 



n — 1 n — 2 

so (1281) holds. The inequality (jH]) holds trivially, as P(L„ > 1) = 0. Hence Condition [3] holds. 

By Lemma [2.11 Condition H] holds with ci = C2 = 2. 

Regarding Condition [5|, as only the degrees of vertex /„ and its neighbors are different in 
the graphs G„ and if„, we have 

\Yn-Vn\<l+DiI„)<Kn, 

and we set i?„ = Kn, so J-'„-measurable. We now finish the verification of Condition O by 
showing flT3l) . sufficient for IQ, is satisfied. By fl25|) . that yU„ ^ = ^Tn^ and the second equality 
in fl2T|) . for all n sufficiently large and all 6 G 0„ we have 

1 1 

^ - r^^2 ^ "^10, 

where the final inequality follows from Lemma [2A1 yielding that TnfiJTQ and 5n,e/5e converge 
uniformly to 1 on 6„, and that 5e is bounded away from zero on (0, h\. 

Lastly, Conditioning holds with /„o = 1 for all n > c? + 2, completing the verification of 
all conditions of Theorem II. 1[ D 

The proof of Lemma 12.11 is straightforward, and is therefore omitted. 
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Lemma 2.1 With Tnfi,T0,6nM and 60 given by [T7\ ), /[T^) . I[21\) and (d^ respectively, for all 
d G N and all b > the function 60 is bounded away from zero and infinity over (0, b], and 
the ratios 



Tn,e Sn,e "Trifi 



and 



a, 



n,e 



r2 



and their reciprocals converge uniformly to 1 on (0, b] as n tends to infinity. 

3 Proof of Theorem 1.1 

We begin the proof of Theorem 11.11 with the following lemma. 

Lemma 3.1 Suppose that for some rii G Nq the nonnegative numbers f, {pn,i}n>ni,o<i<n and 
{an}n>o satisfy 

n n 

Qn < y^ an-iPn,i + f for all u > Hi , and r G (0, 1) where r = sup ^p^,;. (29) 

Then sup^>o'^n < ^^■ 

Proof: As for all n > ni we have Pnfl < t < 1, letting 

(ln,i = — for 1 < / < n and a 



1 - Pn,o 1-1" 

implies 

n oo 

On < / Oj-n-ifln I + a with < N Qn I ^ -^ < T, for all n > Ui. 

' ' 1 — Pn 

1=1 1=1 ^^'" 

Letting a = maxo<„<ni c^n and c = max{a, a{l — r)}, the sequence {bn}n>o defined by 
bn = a for < n < ni and bn+i = Tbn + c ioi n > rii 

has, for n > ni, the explicit form 

_ c <" 

bn = 'Jt"' "^ H , where 7 = a 



1-r ' 1-r 

Since 7 < and r G (0, 1), the sequence {bn}n>o is non-decreasing with limit c/(l — r), and 
hence is bounded. We complete the proof by showing that for all n G Nq we have a^ < &m 
for all < m < n. Clearly the statement holds for < n < rii. Assuming it true for some 
n > ni, using the induction hypotheses, the definition of c and that 6„ is non- decreasing, 

n+l n+1 n+1 

fln+l < X] (^n+l-iqn+l,l + « < ^ K+l-iqn+l,l + C < 6„ ^ qn+l,l + C < rb^ + C = 6„,+i. 
Z=l i=l 1=1 

n 
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The following proof is based on the inductive argument of iBolthausenI ( 19841 ) . 
Proof of Theorem 11.11 With r > recall that Qn,r = {6* G 6„ : r„ ^ > r} and let 



6{n,r) 



sup \P9{Wn,9 < z) - P{Z < z)\ ioTn>no. 
2eK,eee„,r 



(30) 



First note that fITT]) of Theorem 11.11 can be made to hold whenever rn,e < '^i by taking 
C > Ti. By (jlj) the cases hq < n < rii and Tnfi > ti can be handled in this same manner. 
Hence it suffices to show that 



(31) 



5(n, r) < C /r for n>ni and r > ri. 
For z G M and A > let h^^x be the smoothed indicator 

{1 X < z 

I + {z — x)/\ Z < X < z + \ 
x> z + \, 

and let Nh^^x = Ehz^x{Z) with Z a standard normal var iable. Let f (x) b e the unique 
bounded solution to the Stein equation for hz,x{x) (see, e.g. IChen et al.l ( l2010l )) 



hz,x{x) - Nhz,x = f'{x) - xf{x). 



(32) 



Let n > ni,6 G Qn,r for some r > ri and 2; G M and A > 0. Recalling Wn,e 
(Yn — fin,9)/o'n,e, with a slight abuse of notation set 



W 






Substituting Wnfi for x in ( l32l) and taking expectation, and dropping the subscript 9 
when not essential, we obtain 



EeK^xiWn 



Nh 



z,X 



Ee[nWn)-Wnf{Wn)]. 



(33) 



Beginning with the second term on the right hand side of (l33l) . from the definition of Wnfi 
and the size bias relation ([T]) we have 

Ee[Wnf{Wn)] = —Ee[{Yn - ^ln)f{Wn)] = ^Ee (fiW:) - /(iy„)) . 

Taking absolute value in ( l33l) and applying the triangle inequality, we obtain 

\EeK^x{Wn) - NK^xl 

= \Ee[fiWn)-WnfiWn)]\ 

- - ifiw:) - f{w^))] 



Ee 


[fiy 


/in 

(yn 


Ee 


/in 
O-n 


Ee 


/in 


Ee 



<Jr, 



fJ'T. 



f\Wn) - ifm) - /(iy„)) 



CTr, 



{w: - Wr.) nwn) + {w: - Wn)f{Wn) - ifiw:) - /(h^„)) 



(34) 



/in 



0"n 



Ee 



w^-w^ 



\f\w^)-f\Wn^mt 



L^o 
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From the size bias relation ([T]) with f{x) = x we obtain finEe[Y^] = Eg[Y^], and therefore 



EeK - Wn] = Ee 



Y" -Y 

0"„ 



cr„ 



EeXl 



/in 



1 2 '^n 



^nH"n H'r, 



(35) 



Now applyi ng fl35|) and |/'(3;)| < 1 from IChen and Shad (l2004j . Equation (4.6)), see also 
Chen et al.l (I201Q . Lemma 2.5), by conditioning on Wn the first term of fl34l) may be bounded 
by 






Ee 



0"„ 



Eo{-- {W^ - W^ 



Wn f\Wn 



/^r 



< ^^^Ys.TEe{W^-W^\Wn) = ^v^„, (36) 



a, 



recalling the definition of \E'„ in 



Moving now to the second term of (1M|1 . iBolthausenI (119841 . Equation (2.4)) gives 

|/(x)|<l and |x/(x)|<l. 



and combining these inequalities with |/'(x)| < 1 and ( 132|) as in IBolthausenI (11984 Equa- 
tion (2.5)) yields 



\f'{x) - fix + t)| < |t| ( 1 + |a;| + - / 1[,,,+A](a; + ut)du 
Hence, applying the bound \Y^ — F„| < Kn, the second term in (IMl) may be bounded by 



fX. 



Kn/un 



Ee I \t\[l + \Wn\ + T- / l[z,z+X]{Wn + Ut)du dt, 



(37) 



yielding three terms. 

For the first two terms in ( I37p we obtain 



2/i 



'-^Ee I (1 + \Wn\) r''^\dt\ = ^Ee[{l + ^1)^]. 

(^n \ Jo J (^i 



(38) 



Next, as |t| < Kn/cr in the region of integration, we may bound the expectation of the 
remaining term in ( 137|) by 



^E, I A-„ 



K„/ij„ p1 



l[z,z+x]{Wn + ut)dudt 



V 



Clearly, 

llz,z+X]{Wr, + Ut) < (l-l^„J + l[,,,+A](W^n + Wt)lF„,«. 

Substituting (HP]) into (I5U1) . the first term in (HU]) gives rise to the expression 
^Ee (k^ r^^" f\l - lF.,)dudt] = ^Ee [kI{1 - 1^„, 



(39) 



(40) 



(41) 
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Substituting the second term in (BOj) into fl39|) . conditioning on Tn and invoking the Tn 
measurabihty of i^„ and F„ g provided by Conditions [2] and [3], respectively, yields 



J-Kn/^n Jo J 

Kn / / Pf " (^ < iy„ + Mt < ;2 + A) 1^„ ^rfwrft , (42) 

J-Kn/a„ Jo J 



Acr^ 
A(j2 



where P^ " denotes conditional probability with respect to J-'„. To handle the indicator in 
l2|) . note that Condition [3] implies that n — L„ > no on -F„,6i- Hence on Fn^e we may define 

yvnfi = 



'^n-Ln,'tpn,e 

and write 

W^ = (^^^lz^\ w^ + (^^^"j - ( ^"~^""^'"^" ^ := p.W^ + r„,i - r„,2. (43) 

By Conditions \5\ and [3] we have |T„i| < Bn/cn and that Pn,Bn and T„^2 are J^n- 
measurable. Using ( H5]) we may write 

= p/"(p;'(^ - r„,i + r„,2 - Mt) < w^ < p-\z - r„,i + r„,2 -ut + A))ip„ , 

< Pf"(p-i(2;+T„,2-«t)-fi„/(T„_L„,^,„ < W^ < p~\z+T^,2-ut) + Bja^_L„,^^ + p;^'X)lF„^^ 
= P^"{Qn - B^/a.^_L^^^^ <Wn<Qn + B:Jan-L^,^„ + P^'A)!^.^,, (44) 

where we have set 

Qn = p~^{z + Tn,2-Ut). 

Recalling ( 130|) . we have 

Peiz < Wnfi <z + \) 

< \Pe{z < Wnfi <z + \)-P{z<Z<z + X)\+P{z<Z<z + X) 

< 2(5(n,r„,,) + A/v^. (45) 

Since the endpoints of the interval bounding Wn in f l44|) are J-'„- measurable, using Con- 
dition |3] and fHSjl with the appropriate substitutions, the conditional probability in dH]) is 
bounded by 

{25{n - L„,r„_i„,v,„,J + (25„/a„_i„,v,„,e + p;:^A)/y2^j 1^^ , 

< {26{n - Ln,rnfi/c2) + (2^5„/(t„,0 + v/crA)/y27rj If,,,^- 

where we have applied Condition |U and that 5{n,r) is non-increasing in r. As this last 
quantity does not depend on u or t, substitution into fj42|) yields the bound 



2pn,6» J-, 

az:^^ 



7^2 (^25(n - L,,r„,,/C2) + {2^^Bjan,e + v^A)/v^)] !,.„,. (46) 
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Expression f H6|) leads to three terms. By IQ, that Fn^e C {L„ < Sn,e}, and since n — SnM > no 
imphes s„ = sup^^Q^^ Sn^e < n — uq, there exists a positive constant Ci such that the first 
term satisfies 



^^n,e 



Ee [Kl6{n - L„,r„,0/c2)lF„ J 



4/in,6l^n,6l,2 



E'fl 



^(5(n - Ln, rn,e/c2)lF„ 



EeKl 



<^E, 



Ar„, 



^6{n-Ln,rn,e/c2)lF„ 



EaKl 



Ci 



< Y^ 5Z ^(^ ~ ^' ^n,e/c2)tn,e,h (47) 



Ar„fl 



«=o 



where tn,e,h given in ( jTOj) . satisfy 

n 

y^ tn,e,i = 1 for all 61 € G^.r- 



(48) 



;=o 



by 



Dropping the indicator 1f„ g, the sum of the second and third terms of (H6l) are bounded 



4.^^inEe[KlBn] y/2c[^ir, 



'2Ti\ai 



^ol 



EaKi 



(49) 



Collecting terms (jSlD, (EHD, gl]), gT]) and (09]), and letting 



Cn,6»,l 



Cnfia 



fJ'n,e 



\1' 



/^n.e 



(T, 



2 ". 



-Efl 



a, 



nfi 



/2ci 



l + ^)+|W^n,e|]i^, 



and 



—^—Eg [K^{1 - 1f„,JJ H 



0", 



n,6l 



27ra^,e 



for all z G M we have 

\Eeh^^x{Wnfi) - Nh^^xl < t ^ (^(^ - I, rn,e/c2)tnfi,i + c„,e,i + T-c„,e,2- 



Ar. 



n,e 



«=0 



A 



(50) 



Note that Conditions [T] and [21 and [3] and [5l respectively yield the existence of positive 
constants C2 and C3 that 

Cn,e,i < C2/rnfi and c„,0,2 < C'sAn.e- (51) 

As l(w < 2;) < hz^xiw) < l(w < 2 + A) we obtain 

Pe(W^M < ^) - P(^ < ^) < \EeK^x{Wn^0)-EK^x{Z)\+EK^x{Z)-P{Z<z) 

with Ehz.\{Z) — P(Z < z) < P{z < Z < z + \) < A/v27r- Along with a similar lower bound 
obtained by considering hz-\,x{w), in view of (15UI) and (IFIII we have that for every ^ G M 

\Pe{Wn,e <z)-P{Z<z)\<-^y^5{n- I, r^,e/ 02^,9,1 + — + T^ + ^^■ 

^^ " ^^ ^n,e Ar^ g V27r 



/=o 
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Letting A = 2c2Ci/r„^6), and, noting that the right hand side does not depend on z, taking 
supremum over 2; G M yields 



sup \Pe{Wn,e < z) - P{Z < z)\ <^S{n- /,r„^e/c2)t„,0j/2c2 + C^/rn^e 

<Y^S{n- I, r/c2)tn,e,i/2c2 + CJr, (52) 



1=0 



1=0 



for 6*4 = 6*2 + C3/2C2C1 + 2c2Ci/y/2n, where for the last inequality we have used that 
d G On,r, and that 6{n,r) and 1/r are non-increasing functions of r. Taking supremum over 
Qn,ri on the right hand side of (!52|) . then over Qn,r C Qn,ri on the left yields 

S{n, r) < sup ^ 5{n - /, r/c2)tn,e,//2c2 + C^/r. (53) 

eee„,ri ^^^ 

Suppose first that Condition Eb is satisfied, so that L„ = /„^o almost surely for some 
lo,n e No for all 9 G Qn,Ti- If /o,n > Sn then ([10]) and ([53]) yield 5{n,r) < C/^jr, proving (I3T]) . 
Otherwise < /„,o < -s^, and as t„,e,« = 1(/ = /n,o), inequality ([53]) specializes to 

b{n, r) < 5(71 - ln,o, r/c2)/2c2 + C,/r. (54) 



When Condition [6j3 is satisfied, the sum in ([53]) is a continuous function of 6* on the 
compact set On,ri, and hence achieves its supremum on some 6** G On,ri- Letting pn,i = 
tnfi*a/2 using ([IE]) we have 

s„ n 

5{n,r) <^5{n- /, r/c2)p„,i/c2 + C*4/r with ^Pn,i = 1/2. (55) 

1=0 1=0 

As ([54]) is the special case of ([55]) when pn,i = 1(/ = lo)/2, it suffices to handle the latter. 

When the second inequality in Condition JU holds for some C2 it also holds for all larger 
values, and we may therefore assume C2 > 1. Let a„ = for < n < uq, and a„ = 
sup^>,^^ r6{n, r) for n > Hq. For all r > ri and n > ng we have 

(r/c2)(5(n, r/c2) < sup (s/c2)(5(n, S/C2) 

s;s>ri 

= sup s6{n, s) 

s:s>ri/c2 

< sup s6{n, s) + sup sS{n, s) 

s:ri/c2<s<ri s:s>ri 

< ri + a„. 

Using that n > rii implies n — Sn > ^05 multiplication by r in (I55|) yields, with / = 
ri/2 + C*4, that for all n > ni 

rS{n, r) < ^{r/c2)5{n - I, r/c2)pn,i + C*4 < ^(^i + an-i)pn,i + Ci<^^ an-iPn,i + f- 
1=0 1=0 1=0 
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Taking supremum on the left hand side over r > ri and recalhng f l55|) now yields 

n n 

On < ^ an-iPn,i + f with ^Pn,i = 1/2, for all n>ni. 

1=0 1=0 

Lemma [3TT] now implies sup„>„^ a„ < oo. Hence, there exists a constant C such that 6{n, r) < 
C jr for all n > rii and all r > ri, that is, (1311) holds. D 
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